Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 45466 |
| Missing cells | 64196 |
| Missing cells (%) | 7.8% |
| Duplicate rows | 16 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 5.9 MiB |
| Average record size in memory | 137.0 B |
Variable types
| Boolean | 2 |
|---|---|
| Numeric | 6 |
| Text | 8 |
| DateTime | 1 |
| Categorical | 1 |
| Dataset has 16 (< 0.1%) duplicate rows | Duplicates |
adult is highly imbalanced (99.7%) | Imbalance |
status is highly imbalanced (97.0%) | Imbalance |
video is highly imbalanced (97.9%) | Imbalance |
homepage has 37684 (82.9%) missing values | Missing |
overview has 954 (2.1%) missing values | Missing |
tagline has 25054 (55.1%) missing values | Missing |
popularity is highly skewed (γ1 = 29.22545384) | Skewed |
budget has 36573 (80.4%) zeros | Zeros |
revenue has 38052 (83.7%) zeros | Zeros |
runtime has 1558 (3.4%) zeros | Zeros |
vote_average has 2998 (6.6%) zeros | Zeros |
vote_count has 2899 (6.4%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-25 21:21:00.433713 |
|---|---|
| Analysis finished | 2024-04-25 21:21:05.736157 |
| Duration | 5.3 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
adult
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 355.3 KiB |
| False | |
|---|---|
| True | 9 |
| (Missing) | 3 |
| Value | Count | Frequency (%) |
| False | 45454 | |
| True | 9 | < 0.1% |
| (Missing) | 3 | < 0.1% |
budget
Real number (ℝ)
ZEROS 
| Distinct | 1223 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4224578.8 |
| Minimum | 0 |
|---|---|
| Maximum | 3.8 × 108 |
| Zeros | 36573 |
| Zeros (%) | 80.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 25000000 |
| Maximum | 3.8 × 108 |
| Range | 3.8 × 108 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 17424133 |
|---|---|
| Coefficient of variation (CV) | 4.1244662 |
| Kurtosis | 66.765616 |
| Mean | 4224578.8 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.125326 |
| Sum | 1.9206203 × 1011 |
| Variance | 3.036004 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 36573 | |
| 5000000 | 286 | 0.6% |
| 10000000 | 259 | 0.6% |
| 20000000 | 243 | 0.5% |
| 2000000 | 242 | 0.5% |
| 15000000 | 226 | 0.5% |
| 3000000 | 223 | 0.5% |
| 25000000 | 206 | 0.5% |
| 1000000 | 197 | 0.4% |
| 30000000 | 190 | 0.4% |
| Other values (1213) | 6818 | 15.0% |
| Value | Count | Frequency (%) |
| 0 | 36573 | |
| 1 | 25 | 0.1% |
| 2 | 14 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 8 | < 0.1% |
| 5 | 8 | < 0.1% |
| 6 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 380000000 | 1 | < 0.1% |
| 300000000 | 1 | < 0.1% |
| 280000000 | 1 | < 0.1% |
| 270000000 | 1 | < 0.1% |
| 260000000 | 3 | < 0.1% |
| 258000000 | 1 | < 0.1% |
| 255000000 | 1 | < 0.1% |
| 250000000 | 10 | |
| 245000000 | 2 | < 0.1% |
| 237000000 | 1 | < 0.1% |
homepage
Text
MISSING 
| Distinct | 7673 |
|---|---|
| Distinct (%) | 98.6% |
| Missing | 37684 |
| Missing (%) | 82.9% |
| Memory size | 355.3 KiB |
Length
| Max length | 242 |
|---|---|
| Median length | 110 |
| Mean length | 36.712799 |
| Min length | 13 |
Characters and Unicode
| Total characters | 285699 |
|---|---|
| Distinct characters | 91 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7610 ? |
|---|---|
| Unique (%) | 97.8% |
Sample
| 1st row | http://toystory.disney.com/toy-story |
|---|---|
| 2nd row | http://www.mgm.com/view/movie/757/Goldeneye/ |
| 3rd row | http://www.mgm.com/title_title.do?title_star=LEAVINGL |
| 4th row | http://www.sevenmovie.com/ |
| 5th row | http://www.mgm.com/#/our-titles/2083/The-Usual-Suspects |
| Value | Count | Frequency (%) |
| http://www.georgecarlin.com | 12 | 0.2% |
| iso_3166_1 | 7 | 0.1% |
| name | 7 | 0.1% |
| http://www.wernerherzog.com/films-by.html | 7 | 0.1% |
| http://breakblade.jp | 6 | 0.1% |
| http://www.kungfupanda.com | 6 | 0.1% |
| http://www.transformersmovie.com | 5 | 0.1% |
| http://www.missionimpossible.com | 5 | 0.1% |
| http://www.crownintlpictures.com/tztitles.html | 4 | 0.1% |
| http://www.jeffdunham.com | 4 | 0.1% |
| Other values (7658) | 7753 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 25849 | 9.0% |
| / | 25820 | 9.0% |
| w | 19516 | 6.8% |
| o | 18783 | 6.6% |
| e | 18709 | 6.5% |
| . | 15387 | 5.4% |
| m | 15101 | 5.3% |
| h | 13863 | 4.9% |
| i | 13654 | 4.8% |
| c | 11414 | 4.0% |
| Other values (81) | 107603 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 285699 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 25849 | 9.0% |
| / | 25820 | 9.0% |
| w | 19516 | 6.8% |
| o | 18783 | 6.6% |
| e | 18709 | 6.5% |
| . | 15387 | 5.4% |
| m | 15101 | 5.3% |
| h | 13863 | 4.9% |
| i | 13654 | 4.8% |
| c | 11414 | 4.0% |
| Other values (81) | 107603 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 285699 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 25849 | 9.0% |
| / | 25820 | 9.0% |
| w | 19516 | 6.8% |
| o | 18783 | 6.6% |
| e | 18709 | 6.5% |
| . | 15387 | 5.4% |
| m | 15101 | 5.3% |
| h | 13863 | 4.9% |
| i | 13654 | 4.8% |
| c | 11414 | 4.0% |
| Other values (81) | 107603 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 285699 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 25849 | 9.0% |
| / | 25820 | 9.0% |
| w | 19516 | 6.8% |
| o | 18783 | 6.6% |
| e | 18709 | 6.5% |
| . | 15387 | 5.4% |
| m | 15101 | 5.3% |
| h | 13863 | 4.9% |
| i | 13654 | 4.8% |
| c | 11414 | 4.0% |
| Other values (81) | 107603 |
id
Text
| Distinct | 45436 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.3 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 5.2514846 |
| Min length | 1 |
Characters and Unicode
| Total characters | 238764 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 45407 ? |
|---|---|
| Unique (%) | 99.9% |
Sample
| 1st row | 862 |
|---|---|
| 2nd row | 8844 |
| 3rd row | 15602 |
| 4th row | 31357 |
| 5th row | 11862 |
| Value | Count | Frequency (%) |
| 141971 | 3 | < 0.1% |
| 12600 | 2 | < 0.1% |
| 109962 | 2 | < 0.1% |
| 69234 | 2 | < 0.1% |
| 5511 | 2 | < 0.1% |
| 159849 | 2 | < 0.1% |
| 25541 | 2 | < 0.1% |
| 42495 | 2 | < 0.1% |
| 298721 | 2 | < 0.1% |
| 14788 | 2 | < 0.1% |
| Other values (45426) | 45445 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 32923 | |
| 2 | 28625 | |
| 3 | 26732 | |
| 4 | 24747 | |
| 5 | 21996 | |
| 6 | 21184 | |
| 7 | 20949 | |
| 8 | 20909 | |
| 9 | 20485 | |
| 0 | 20208 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 238764 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 32923 | |
| 2 | 28625 | |
| 3 | 26732 | |
| 4 | 24747 | |
| 5 | 21996 | |
| 6 | 21184 | |
| 7 | 20949 | |
| 8 | 20909 | |
| 9 | 20485 | |
| 0 | 20208 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 238764 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 32923 | |
| 2 | 28625 | |
| 3 | 26732 | |
| 4 | 24747 | |
| 5 | 21996 | |
| 6 | 21184 | |
| 7 | 20949 | |
| 8 | 20909 | |
| 9 | 20485 | |
| 0 | 20208 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 238764 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 32923 | |
| 2 | 28625 | |
| 3 | 26732 | |
| 4 | 24747 | |
| 5 | 21996 | |
| 6 | 21184 | |
| 7 | 20949 | |
| 8 | 20909 | |
| 9 | 20485 | |
| 0 | 20208 |
imdb_id
Text
| Distinct | 45417 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 17 |
| Missing (%) | < 0.1% |
| Memory size | 355.3 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.9994719 |
| Min length | 1 |
Characters and Unicode
| Total characters | 409017 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 45387 ? |
|---|---|
| Unique (%) | 99.9% |
Sample
| 1st row | tt0114709 |
|---|---|
| 2nd row | tt0113497 |
| 3rd row | tt0113228 |
| 4th row | tt0114885 |
| 5th row | tt0113041 |
| Value | Count | Frequency (%) |
| tt1180333 | 3 | < 0.1% |
| 0 | 3 | < 0.1% |
| tt0295682 | 2 | < 0.1% |
| tt0100361 | 2 | < 0.1% |
| tt1821641 | 2 | < 0.1% |
| tt0062229 | 2 | < 0.1% |
| tt0173769 | 2 | < 0.1% |
| tt1327820 | 2 | < 0.1% |
| tt0022879 | 2 | < 0.1% |
| tt0111613 | 2 | < 0.1% |
| Other values (45407) | 45427 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 90892 | |
| 0 | 69913 | |
| 1 | 37232 | |
| 2 | 31234 | 7.6% |
| 4 | 28498 | 7.0% |
| 3 | 28135 | 6.9% |
| 8 | 25445 | 6.2% |
| 6 | 25442 | 6.2% |
| 5 | 24253 | 5.9% |
| 7 | 24221 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 409017 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 90892 | |
| 0 | 69913 | |
| 1 | 37232 | |
| 2 | 31234 | 7.6% |
| 4 | 28498 | 7.0% |
| 3 | 28135 | 6.9% |
| 8 | 25445 | 6.2% |
| 6 | 25442 | 6.2% |
| 5 | 24253 | 5.9% |
| 7 | 24221 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 409017 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 90892 | |
| 0 | 69913 | |
| 1 | 37232 | |
| 2 | 31234 | 7.6% |
| 4 | 28498 | 7.0% |
| 3 | 28135 | 6.9% |
| 8 | 25445 | 6.2% |
| 6 | 25442 | 6.2% |
| 5 | 24253 | 5.9% |
| 7 | 24221 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 409017 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 90892 | |
| 0 | 69913 | |
| 1 | 37232 | |
| 2 | 31234 | 7.6% |
| 4 | 28498 | 7.0% |
| 3 | 28135 | 6.9% |
| 8 | 25445 | 6.2% |
| 6 | 25442 | 6.2% |
| 5 | 24253 | 5.9% |
| 7 | 24221 | 5.9% |
| Distinct | 92 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 11 |
| Missing (%) | < 0.1% |
| Memory size | 355.3 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 2.000154 |
| Min length | 2 |
Characters and Unicode
| Total characters | 90917 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | en |
|---|---|
| 2nd row | en |
| 3rd row | en |
| 4th row | en |
| 5th row | en |
| Value | Count | Frequency (%) |
| en | 32269 | |
| fr | 2438 | 5.4% |
| it | 1529 | 3.4% |
| ja | 1350 | 3.0% |
| de | 1080 | 2.4% |
| es | 994 | 2.2% |
| ru | 826 | 1.8% |
| hi | 508 | 1.1% |
| ko | 444 | 1.0% |
| zh | 409 | 0.9% |
| Other values (82) | 3608 | 7.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 34598 | |
| n | 32978 | |
| r | 3636 | 4.0% |
| f | 2839 | 3.1% |
| i | 2391 | 2.6% |
| t | 2252 | 2.5% |
| a | 1841 | 2.0% |
| s | 1654 | 1.8% |
| j | 1351 | 1.5% |
| d | 1325 | 1.5% |
| Other values (23) | 6052 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 90917 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 34598 | |
| n | 32978 | |
| r | 3636 | 4.0% |
| f | 2839 | 3.1% |
| i | 2391 | 2.6% |
| t | 2252 | 2.5% |
| a | 1841 | 2.0% |
| s | 1654 | 1.8% |
| j | 1351 | 1.5% |
| d | 1325 | 1.5% |
| Other values (23) | 6052 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 90917 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 34598 | |
| n | 32978 | |
| r | 3636 | 4.0% |
| f | 2839 | 3.1% |
| i | 2391 | 2.6% |
| t | 2252 | 2.5% |
| a | 1841 | 2.0% |
| s | 1654 | 1.8% |
| j | 1351 | 1.5% |
| d | 1325 | 1.5% |
| Other values (23) | 6052 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 90917 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 34598 | |
| n | 32978 | |
| r | 3636 | 4.0% |
| f | 2839 | 3.1% |
| i | 2391 | 2.6% |
| t | 2252 | 2.5% |
| a | 1841 | 2.0% |
| s | 1654 | 1.8% |
| j | 1351 | 1.5% |
| d | 1325 | 1.5% |
| Other values (23) | 6052 | 6.7% |
original_title
Text
| Distinct | 43373 |
|---|---|
| Distinct (%) | 95.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.3 KiB |
Length
| Max length | 109 |
|---|---|
| Median length | 84 |
| Mean length | 16.323494 |
| Min length | 1 |
Characters and Unicode
| Total characters | 742164 |
|---|---|
| Distinct characters | 2946 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 41712 ? |
|---|---|
| Unique (%) | 91.7% |
Sample
| 1st row | Toy Story |
|---|---|
| 2nd row | Jumanji |
| 3rd row | Grumpier Old Men |
| 4th row | Waiting to Exhale |
| 5th row | Father of the Bride Part II |
| Value | Count | Frequency (%) |
| the | 10261 | 7.8% |
| of | 3309 | 2.5% |
| a | 1674 | 1.3% |
| in | 1275 | 1.0% |
| and | 1072 | 0.8% |
| la | 1007 | 0.8% |
| 863 | 0.7% | |
| to | 806 | 0.6% |
| de | 702 | 0.5% |
| man | 509 | 0.4% |
| Other values (35324) | 110301 |
Most occurring characters
| Value | Count | Frequency (%) |
| 86293 | 11.6% | |
| e | 70665 | 9.5% |
| a | 49100 | 6.6% |
| o | 42066 | 5.7% |
| i | 39494 | 5.3% |
| n | 39149 | 5.3% |
| r | 37728 | 5.1% |
| t | 33530 | 4.5% |
| s | 28615 | 3.9% |
| l | 25557 | 3.4% |
| Other values (2936) | 289967 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 742164 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 86293 | 11.6% | |
| e | 70665 | 9.5% |
| a | 49100 | 6.6% |
| o | 42066 | 5.7% |
| i | 39494 | 5.3% |
| n | 39149 | 5.3% |
| r | 37728 | 5.1% |
| t | 33530 | 4.5% |
| s | 28615 | 3.9% |
| l | 25557 | 3.4% |
| Other values (2936) | 289967 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 742164 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 86293 | 11.6% | |
| e | 70665 | 9.5% |
| a | 49100 | 6.6% |
| o | 42066 | 5.7% |
| i | 39494 | 5.3% |
| n | 39149 | 5.3% |
| r | 37728 | 5.1% |
| t | 33530 | 4.5% |
| s | 28615 | 3.9% |
| l | 25557 | 3.4% |
| Other values (2936) | 289967 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 742164 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 86293 | 11.6% | |
| e | 70665 | 9.5% |
| a | 49100 | 6.6% |
| o | 42066 | 5.7% |
| i | 39494 | 5.3% |
| n | 39149 | 5.3% |
| r | 37728 | 5.1% |
| t | 33530 | 4.5% |
| s | 28615 | 3.9% |
| l | 25557 | 3.4% |
| Other values (2936) | 289967 |
overview
Text
MISSING 
| Distinct | 44307 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 954 |
| Missing (%) | 2.1% |
| Memory size | 355.3 KiB |
Length
| Max length | 1000 |
|---|---|
| Median length | 785 |
| Mean length | 323.32155 |
| Min length | 1 |
Characters and Unicode
| Total characters | 14391689 |
|---|---|
| Distinct characters | 429 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 44247 ? |
|---|---|
| Unique (%) | 99.4% |
Sample
| 1st row | Led by Woody, Andy's toys live happily in his room until Andy's birthday brings Buzz Lightyear onto the scene. Afraid of losing his place in Andy's heart, Woody plots against Buzz. But when circumstances separate Buzz and Woody from their owner, the duo eventually learns to put aside their differences. |
|---|---|
| 2nd row | When siblings Judy and Peter discover an enchanted board game that opens the door to a magical world, they unwittingly invite Alan -- an adult who's been trapped inside the game for 26 years -- into their living room. Alan's only hope for freedom is to finish the game, which proves risky as all three find themselves running from giant rhinoceroses, evil monkeys and other terrifying creatures. |
| 3rd row | A family wedding reignites the ancient feud between next-door neighbors and fishing buddies John and Max. Meanwhile, a sultry Italian divorcée opens a restaurant at the local bait shop, alarming the locals who worry she'll scare the fish away. But she's less interested in seafood than she is in cooking up a hot time with Max. |
| 4th row | Cheated on, mistreated and stepped on, the women are holding their breath, waiting for the elusive "good man" to break a string of less-than-stellar lovers. Friends and confidants Vannah, Bernie, Glo and Robin talk it all out, determined to find a better way to breathe. |
| 5th row | Just when George Banks has recovered from his daughter's wedding, he receives the news that she's pregnant ... and that George's wife, Nina, is expecting too. He was planning on selling their home, but that's a plan that -- like George -- will have to change with the arrival of both a grandchild and a kid of his own. |
| Value | Count | Frequency (%) |
| the | 138357 | 5.6% |
| a | 99037 | 4.0% |
| and | 75407 | 3.1% |
| to | 73442 | 3.0% |
| of | 69723 | 2.8% |
| in | 48228 | 2.0% |
| is | 36550 | 1.5% |
| his | 36210 | 1.5% |
| with | 23933 | 1.0% |
| her | 21518 | 0.9% |
| Other values (97181) | 1830623 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2410599 | ||
| e | 1366183 | 9.5% |
| a | 942278 | 6.5% |
| t | 936476 | 6.5% |
| i | 853105 | 5.9% |
| o | 831419 | 5.8% |
| n | 824147 | 5.7% |
| s | 769188 | 5.3% |
| r | 745638 | 5.2% |
| h | 601821 | 4.2% |
| Other values (419) | 4110835 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 14391689 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2410599 | ||
| e | 1366183 | 9.5% |
| a | 942278 | 6.5% |
| t | 936476 | 6.5% |
| i | 853105 | 5.9% |
| o | 831419 | 5.8% |
| n | 824147 | 5.7% |
| s | 769188 | 5.3% |
| r | 745638 | 5.2% |
| h | 601821 | 4.2% |
| Other values (419) | 4110835 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 14391689 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2410599 | ||
| e | 1366183 | 9.5% |
| a | 942278 | 6.5% |
| t | 936476 | 6.5% |
| i | 853105 | 5.9% |
| o | 831419 | 5.8% |
| n | 824147 | 5.7% |
| s | 769188 | 5.3% |
| r | 745638 | 5.2% |
| h | 601821 | 4.2% |
| Other values (419) | 4110835 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 14391689 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2410599 | ||
| e | 1366183 | 9.5% |
| a | 942278 | 6.5% |
| t | 936476 | 6.5% |
| i | 853105 | 5.9% |
| o | 831419 | 5.8% |
| n | 824147 | 5.7% |
| s | 769188 | 5.3% |
| r | 745638 | 5.2% |
| h | 601821 | 4.2% |
| Other values (419) | 4110835 |
popularity
Real number (ℝ)
SKEWED 
| Distinct | 43757 |
|---|---|
| Distinct (%) | 96.3% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9214783 |
| Minimum | 0 |
|---|---|
| Maximum | 547.4883 |
| Zeros | 66 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.018921 |
| Q1 | 0.38594775 |
| median | 1.127685 |
| Q3 | 3.6789023 |
| 95-th percentile | 11.061568 |
| Maximum | 547.4883 |
| Range | 547.4883 |
| Interquartile range (IQR) | 3.2929545 |
Descriptive statistics
| Standard deviation | 6.0054143 |
|---|---|
| Coefficient of variation (CV) | 2.055608 |
| Kurtosis | 1925.684 |
| Mean | 2.9214783 |
| Median Absolute Deviation (MAD) | 0.9672565 |
| Skewness | 29.225454 |
| Sum | 132810.41 |
| Variance | 36.065001 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 66 | 0.1% |
| 1 × 10-6 | 56 | 0.1% |
| 0.000308 | 43 | 0.1% |
| 0.00022 | 40 | 0.1% |
| 0.000578 | 38 | 0.1% |
| 0.001177 | 38 | 0.1% |
| 0.000844 | 38 | 0.1% |
| 0.002001 | 28 | 0.1% |
| 0.003013 | 21 | < 0.1% |
| 0.00353 | 19 | < 0.1% |
| Other values (43747) | 45073 |
| Value | Count | Frequency (%) |
| 0 | 66 | |
| 1 × 10-6 | 56 | |
| 2 × 10-6 | 6 | < 0.1% |
| 3 × 10-6 | 6 | < 0.1% |
| 4 × 10-6 | 5 | < 0.1% |
| 5 × 10-6 | 1 | < 0.1% |
| 6 × 10-6 | 4 | < 0.1% |
| 7 × 10-6 | 1 | < 0.1% |
| 8 × 10-6 | 6 | < 0.1% |
| 9 × 10-6 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 547.488298 | 1 | |
| 294.337037 | 1 | |
| 287.253654 | 1 | |
| 228.032744 | 1 | |
| 213.849907 | 1 | |
| 187.860492 | 1 | |
| 185.330992 | 1 | |
| 185.070892 | 1 | |
| 183.870374 | 1 | |
| 154.801009 | 1 |
release_date
Date
| Distinct | 17333 |
|---|---|
| Distinct (%) | 38.2% |
| Missing | 90 |
| Missing (%) | 0.2% |
| Memory size | 355.3 KiB |
| Minimum | 1874-12-09 00:00:00 |
|---|---|
| Maximum | 2020-12-16 00:00:00 |
revenue
Real number (ℝ)
ZEROS 
| Distinct | 6863 |
|---|---|
| Distinct (%) | 15.1% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11209349 |
| Minimum | 0 |
|---|---|
| Maximum | 2.7879651 × 109 |
| Zeros | 38052 |
| Zeros (%) | 83.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 47808918 |
| Maximum | 2.7879651 × 109 |
| Range | 2.7879651 × 109 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 64332247 |
|---|---|
| Coefficient of variation (CV) | 5.7391602 |
| Kurtosis | 237.51059 |
| Mean | 11209349 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.265983 |
| Sum | 5.0957698 × 1011 |
| Variance | 4.138638 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38052 | |
| 12000000 | 20 | < 0.1% |
| 11000000 | 19 | < 0.1% |
| 10000000 | 19 | < 0.1% |
| 2000000 | 18 | < 0.1% |
| 6000000 | 17 | < 0.1% |
| 5000000 | 14 | < 0.1% |
| 8000000 | 13 | < 0.1% |
| 500000 | 13 | < 0.1% |
| 1 | 12 | < 0.1% |
| Other values (6853) | 7263 | 16.0% |
| Value | Count | Frequency (%) |
| 0 | 38052 | |
| 1 | 12 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 5 | < 0.1% |
| 6 | 2 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2787965087 | 1 | |
| 2068223624 | 1 | |
| 1845034188 | 1 | |
| 1519557910 | 1 | |
| 1513528810 | 1 | |
| 1506249360 | 1 | |
| 1405403694 | 1 | |
| 1342000000 | 1 | |
| 1274219009 | 1 | |
| 1262886337 | 1 |
runtime
Real number (ℝ)
ZEROS 
| Distinct | 353 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 263 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 94.128199 |
| Minimum | 0 |
|---|---|
| Maximum | 1256 |
| Zeros | 1558 |
| Zeros (%) | 3.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 85 |
| median | 95 |
| Q3 | 107 |
| 95-th percentile | 138 |
| Maximum | 1256 |
| Range | 1256 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 38.40781 |
|---|---|
| Coefficient of variation (CV) | 0.40803724 |
| Kurtosis | 93.217158 |
| Mean | 94.128199 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 4.4659579 |
| Sum | 4254877 |
| Variance | 1475.1599 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 2556 | 5.6% |
| 0 | 1558 | 3.4% |
| 100 | 1470 | 3.2% |
| 95 | 1412 | 3.1% |
| 93 | 1214 | 2.7% |
| 96 | 1104 | 2.4% |
| 92 | 1080 | 2.4% |
| 94 | 1062 | 2.3% |
| 91 | 1057 | 2.3% |
| 88 | 1032 | 2.3% |
| Other values (343) | 31658 |
| Value | Count | Frequency (%) |
| 0 | 1558 | |
| 1 | 107 | 0.2% |
| 2 | 33 | 0.1% |
| 3 | 48 | 0.1% |
| 4 | 51 | 0.1% |
| 5 | 51 | 0.1% |
| 6 | 72 | 0.2% |
| 7 | 103 | 0.2% |
| 8 | 78 | 0.2% |
| 9 | 63 | 0.1% |
| Value | Count | Frequency (%) |
| 1256 | 1 | |
| 1140 | 2 | |
| 931 | 1 | |
| 925 | 1 | |
| 900 | 1 | |
| 877 | 1 | |
| 874 | 1 | |
| 840 | 2 | |
| 780 | 1 | |
| 720 | 1 |
status
Categorical
IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 87 |
| Missing (%) | 0.2% |
| Memory size | 355.3 KiB |
| Released | |
|---|---|
| Rumored | 230 |
| Post Production | 98 |
| In Production | 20 |
| Planned | 15 |
Length
| Max length | 15 |
|---|---|
| Median length | 8 |
| Mean length | 8.0119218 |
| Min length | 7 |
Characters and Unicode
| Total characters | 363573 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Released |
|---|---|
| 2nd row | Released |
| 3rd row | Released |
| 4th row | Released |
| 5th row | Released |
Common Values
| Value | Count | Frequency (%) |
| Released | 45014 | |
| Rumored | 230 | 0.5% |
| Post Production | 98 | 0.2% |
| In Production | 20 | < 0.1% |
| Planned | 15 | < 0.1% |
| Canceled | 2 | < 0.1% |
| (Missing) | 87 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| released | 45014 | |
| rumored | 230 | 0.5% |
| production | 118 | 0.3% |
| post | 98 | 0.2% |
| in | 20 | < 0.1% |
| planned | 15 | < 0.1% |
| canceled | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 135291 | |
| d | 45379 | 12.5% |
| R | 45244 | 12.4% |
| s | 45112 | 12.4% |
| l | 45031 | 12.4% |
| a | 45031 | 12.4% |
| o | 564 | 0.2% |
| r | 348 | 0.1% |
| u | 348 | 0.1% |
| P | 231 | 0.1% |
| Other values (8) | 994 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 363573 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 135291 | |
| d | 45379 | 12.5% |
| R | 45244 | 12.4% |
| s | 45112 | 12.4% |
| l | 45031 | 12.4% |
| a | 45031 | 12.4% |
| o | 564 | 0.2% |
| r | 348 | 0.1% |
| u | 348 | 0.1% |
| P | 231 | 0.1% |
| Other values (8) | 994 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 363573 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 135291 | |
| d | 45379 | 12.5% |
| R | 45244 | 12.4% |
| s | 45112 | 12.4% |
| l | 45031 | 12.4% |
| a | 45031 | 12.4% |
| o | 564 | 0.2% |
| r | 348 | 0.1% |
| u | 348 | 0.1% |
| P | 231 | 0.1% |
| Other values (8) | 994 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 363573 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 135291 | |
| d | 45379 | 12.5% |
| R | 45244 | 12.4% |
| s | 45112 | 12.4% |
| l | 45031 | 12.4% |
| a | 45031 | 12.4% |
| o | 564 | 0.2% |
| r | 348 | 0.1% |
| u | 348 | 0.1% |
| P | 231 | 0.1% |
| Other values (8) | 994 | 0.3% |
tagline
Text
MISSING 
| Distinct | 20283 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 25054 |
| Missing (%) | 55.1% |
| Memory size | 355.3 KiB |
Length
| Max length | 297 |
|---|---|
| Median length | 204 |
| Mean length | 47.002841 |
| Min length | 1 |
Characters and Unicode
| Total characters | 959422 |
|---|---|
| Distinct characters | 170 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 20177 ? |
|---|---|
| Unique (%) | 98.8% |
Sample
| 1st row | Roll the dice and unleash the excitement! |
|---|---|
| 2nd row | Still Yelling. Still Fighting. Still Ready for Love. |
| 3rd row | Friends are the people who let you be yourself... and never let you forget it. |
| 4th row | Just When His World Is Back To Normal... He's In For The Surprise Of His Life! |
| 5th row | A Los Angeles Crime Saga |
| Value | Count | Frequency (%) |
| the | 11004 | 6.3% |
| a | 6820 | 3.9% |
| of | 4406 | 2.5% |
| to | 3586 | 2.1% |
| is | 2800 | 1.6% |
| in | 2693 | 1.5% |
| and | 2686 | 1.5% |
| you | 2389 | 1.4% |
| 1585 | 0.9% | |
| for | 1524 | 0.9% |
| Other values (15108) | 134566 |
Most occurring characters
| Value | Count | Frequency (%) |
| 153795 | ||
| e | 94486 | 9.8% |
| t | 57309 | 6.0% |
| o | 56611 | 5.9% |
| a | 51521 | 5.4% |
| n | 47539 | 5.0% |
| i | 46086 | 4.8% |
| r | 45029 | 4.7% |
| s | 42399 | 4.4% |
| h | 37192 | 3.9% |
| Other values (160) | 327455 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 959422 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 153795 | ||
| e | 94486 | 9.8% |
| t | 57309 | 6.0% |
| o | 56611 | 5.9% |
| a | 51521 | 5.4% |
| n | 47539 | 5.0% |
| i | 46086 | 4.8% |
| r | 45029 | 4.7% |
| s | 42399 | 4.4% |
| h | 37192 | 3.9% |
| Other values (160) | 327455 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 959422 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 153795 | ||
| e | 94486 | 9.8% |
| t | 57309 | 6.0% |
| o | 56611 | 5.9% |
| a | 51521 | 5.4% |
| n | 47539 | 5.0% |
| i | 46086 | 4.8% |
| r | 45029 | 4.7% |
| s | 42399 | 4.4% |
| h | 37192 | 3.9% |
| Other values (160) | 327455 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 959422 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 153795 | ||
| e | 94486 | 9.8% |
| t | 57309 | 6.0% |
| o | 56611 | 5.9% |
| a | 51521 | 5.4% |
| n | 47539 | 5.0% |
| i | 46086 | 4.8% |
| r | 45029 | 4.7% |
| s | 42399 | 4.4% |
| h | 37192 | 3.9% |
| Other values (160) | 327455 |
title
Text
| Distinct | 42277 |
|---|---|
| Distinct (%) | 93.0% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 355.3 KiB |
Length
| Max length | 105 |
|---|---|
| Median length | 79 |
| Mean length | 16.708535 |
| Min length | 1 |
Characters and Unicode
| Total characters | 759570 |
|---|---|
| Distinct characters | 287 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 39947 ? |
|---|---|
| Unique (%) | 87.9% |
Sample
| 1st row | Toy Story |
|---|---|
| 2nd row | Jumanji |
| 3rd row | Grumpier Old Men |
| 4th row | Waiting to Exhale |
| 5th row | Father of the Bride Part II |
| Value | Count | Frequency (%) |
| the | 14571 | 10.7% |
| of | 4938 | 3.6% |
| a | 2244 | 1.6% |
| in | 1697 | 1.2% |
| and | 1634 | 1.2% |
| to | 1055 | 0.8% |
| 763 | 0.6% | |
| man | 665 | 0.5% |
| love | 664 | 0.5% |
| for | 602 | 0.4% |
| Other values (24431) | 107634 |
Most occurring characters
| Value | Count | Frequency (%) |
| 91029 | 12.0% | |
| e | 76408 | 10.1% |
| a | 49056 | 6.5% |
| o | 45765 | 6.0% |
| n | 40931 | 5.4% |
| r | 40096 | 5.3% |
| i | 39859 | 5.2% |
| t | 36792 | 4.8% |
| s | 29591 | 3.9% |
| h | 28564 | 3.8% |
| Other values (277) | 281479 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 759570 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 91029 | 12.0% | |
| e | 76408 | 10.1% |
| a | 49056 | 6.5% |
| o | 45765 | 6.0% |
| n | 40931 | 5.4% |
| r | 40096 | 5.3% |
| i | 39859 | 5.2% |
| t | 36792 | 4.8% |
| s | 29591 | 3.9% |
| h | 28564 | 3.8% |
| Other values (277) | 281479 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 759570 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 91029 | 12.0% | |
| e | 76408 | 10.1% |
| a | 49056 | 6.5% |
| o | 45765 | 6.0% |
| n | 40931 | 5.4% |
| r | 40096 | 5.3% |
| i | 39859 | 5.2% |
| t | 36792 | 4.8% |
| s | 29591 | 3.9% |
| h | 28564 | 3.8% |
| Other values (277) | 281479 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 759570 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 91029 | 12.0% | |
| e | 76408 | 10.1% |
| a | 49056 | 6.5% |
| o | 45765 | 6.0% |
| n | 40931 | 5.4% |
| r | 40096 | 5.3% |
| i | 39859 | 5.2% |
| t | 36792 | 4.8% |
| s | 29591 | 3.9% |
| h | 28564 | 3.8% |
| Other values (277) | 281479 |
video
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.5 KiB |
| False | |
|---|---|
| True | 93 |
| Value | Count | Frequency (%) |
| False | 45373 | |
| True | 93 | 0.2% |
vote_average
Real number (ℝ)
ZEROS 
| Distinct | 92 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.6182072 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 2998 |
| Zeros (%) | 6.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 6 |
| Q3 | 6.8 |
| 95-th percentile | 7.8 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 1.8 |
Descriptive statistics
| Standard deviation | 1.924216 |
|---|---|
| Coefficient of variation (CV) | 0.34249644 |
| Kurtosis | 2.5004022 |
| Mean | 5.6182072 |
| Median Absolute Deviation (MAD) | 0.9 |
| Skewness | -1.5189901 |
| Sum | 255403.7 |
| Variance | 3.7026072 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2998 | 6.6% |
| 6 | 2468 | 5.4% |
| 5 | 2001 | 4.4% |
| 7 | 1886 | 4.1% |
| 6.5 | 1722 | 3.8% |
| 6.3 | 1603 | 3.5% |
| 5.5 | 1381 | 3.0% |
| 5.8 | 1369 | 3.0% |
| 6.4 | 1350 | 3.0% |
| 6.7 | 1342 | 3.0% |
| Other values (82) | 27340 |
| Value | Count | Frequency (%) |
| 0 | 2998 | |
| 0.5 | 13 | < 0.1% |
| 0.7 | 1 | < 0.1% |
| 1 | 105 | 0.2% |
| 1.1 | 1 | < 0.1% |
| 1.2 | 4 | < 0.1% |
| 1.3 | 13 | < 0.1% |
| 1.4 | 5 | < 0.1% |
| 1.5 | 30 | 0.1% |
| 1.6 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 190 | |
| 9.8 | 1 | < 0.1% |
| 9.6 | 1 | < 0.1% |
| 9.5 | 18 | < 0.1% |
| 9.4 | 3 | < 0.1% |
| 9.3 | 18 | < 0.1% |
| 9.2 | 4 | < 0.1% |
| 9.1 | 3 | < 0.1% |
| 9 | 159 | |
| 8.9 | 7 | < 0.1% |
vote_count
Real number (ℝ)
ZEROS 
| Distinct | 1820 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 109.89734 |
| Minimum | 0 |
|---|---|
| Maximum | 14075 |
| Zeros | 2899 |
| Zeros (%) | 6.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 10 |
| Q3 | 34 |
| 95-th percentile | 434 |
| Maximum | 14075 |
| Range | 14075 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 491.31037 |
|---|---|
| Coefficient of variation (CV) | 4.4706303 |
| Kurtosis | 151.2028 |
| Mean | 109.89734 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 10.450232 |
| Sum | 4995933 |
| Variance | 241385.88 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3264 | 7.2% |
| 2 | 3132 | 6.9% |
| 0 | 2899 | 6.4% |
| 3 | 2787 | 6.1% |
| 4 | 2480 | 5.5% |
| 5 | 2097 | 4.6% |
| 6 | 1747 | 3.8% |
| 7 | 1570 | 3.5% |
| 8 | 1359 | 3.0% |
| 9 | 1194 | 2.6% |
| Other values (1810) | 22931 |
| Value | Count | Frequency (%) |
| 0 | 2899 | |
| 1 | 3264 | |
| 2 | 3132 | |
| 3 | 2787 | |
| 4 | 2480 | |
| 5 | 2097 | |
| 6 | 1747 | |
| 7 | 1570 | |
| 8 | 1359 | |
| 9 | 1194 | 2.6% |
| Value | Count | Frequency (%) |
| 14075 | 1 | |
| 12269 | 1 | |
| 12114 | 1 | |
| 12000 | 1 | |
| 11444 | 1 | |
| 11187 | 1 | |
| 10297 | 1 | |
| 10014 | 1 | |
| 9678 | 1 | |
| 9634 | 1 |
| adult | budget | homepage | id | imdb_id | original_language | original_title | overview | popularity | release_date | revenue | runtime | status | tagline | title | video | vote_average | vote_count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | False | 30000000.0 | http://toystory.disney.com/toy-story | 862 | tt0114709 | en | Toy Story | Led by Woody, Andy's toys live happily in his room until Andy's birthday brings Buzz Lightyear onto the scene. Afraid of losing his place in Andy's heart, Woody plots against Buzz. But when circumstances separate Buzz and Woody from their owner, the duo eventually learns to put aside their differences. | 21.946943 | 1995-10-30 | 373554033.0 | 81.0 | Released | NaN | Toy Story | False | 7.7 | 5415.0 |
| 1 | False | 65000000.0 | NaN | 8844 | tt0113497 | en | Jumanji | When siblings Judy and Peter discover an enchanted board game that opens the door to a magical world, they unwittingly invite Alan -- an adult who's been trapped inside the game for 26 years -- into their living room. Alan's only hope for freedom is to finish the game, which proves risky as all three find themselves running from giant rhinoceroses, evil monkeys and other terrifying creatures. | 17.015539 | 1995-12-15 | 262797249.0 | 104.0 | Released | Roll the dice and unleash the excitement! | Jumanji | False | 6.9 | 2413.0 |
| 2 | False | 0.0 | NaN | 15602 | tt0113228 | en | Grumpier Old Men | A family wedding reignites the ancient feud between next-door neighbors and fishing buddies John and Max. Meanwhile, a sultry Italian divorcée opens a restaurant at the local bait shop, alarming the locals who worry she'll scare the fish away. But she's less interested in seafood than she is in cooking up a hot time with Max. | 11.712900 | 1995-12-22 | 0.0 | 101.0 | Released | Still Yelling. Still Fighting. Still Ready for Love. | Grumpier Old Men | False | 6.5 | 92.0 |
| 3 | False | 16000000.0 | NaN | 31357 | tt0114885 | en | Waiting to Exhale | Cheated on, mistreated and stepped on, the women are holding their breath, waiting for the elusive "good man" to break a string of less-than-stellar lovers. Friends and confidants Vannah, Bernie, Glo and Robin talk it all out, determined to find a better way to breathe. | 3.859495 | 1995-12-22 | 81452156.0 | 127.0 | Released | Friends are the people who let you be yourself... and never let you forget it. | Waiting to Exhale | False | 6.1 | 34.0 |
| 4 | False | 0.0 | NaN | 11862 | tt0113041 | en | Father of the Bride Part II | Just when George Banks has recovered from his daughter's wedding, he receives the news that she's pregnant ... and that George's wife, Nina, is expecting too. He was planning on selling their home, but that's a plan that -- like George -- will have to change with the arrival of both a grandchild and a kid of his own. | 8.387519 | 1995-02-10 | 76578911.0 | 106.0 | Released | Just When His World Is Back To Normal... He's In For The Surprise Of His Life! | Father of the Bride Part II | False | 5.7 | 173.0 |
| 5 | False | 60000000.0 | NaN | 949 | tt0113277 | en | Heat | Obsessive master thief, Neil McCauley leads a top-notch crew on various insane heists throughout Los Angeles while a mentally unstable detective, Vincent Hanna pursues him without rest. Each man recognizes and respects the ability and the dedication of the other even though they are aware their cat-and-mouse game may end in violence. | 17.924927 | 1995-12-15 | 187436818.0 | 170.0 | Released | A Los Angeles Crime Saga | Heat | False | 7.7 | 1886.0 |
| 6 | False | 58000000.0 | NaN | 11860 | tt0114319 | en | Sabrina | An ugly duckling having undergone a remarkable change, still harbors feelings for her crush: a carefree playboy, but not before his business-focused brother has something to say about it. | 6.677277 | 1995-12-15 | 0.0 | 127.0 | Released | You are cordially invited to the most surprising merger of the year. | Sabrina | False | 6.2 | 141.0 |
| 7 | False | 0.0 | NaN | 45325 | tt0112302 | en | Tom and Huck | A mischievous young boy, Tom Sawyer, witnesses a murder by the deadly Injun Joe. Tom becomes friends with Huckleberry Finn, a boy with no future and no family. Tom has to choose between honoring a friendship or honoring an oath because the town alcoholic is accused of the murder. Tom and Huck go through several adventures trying to retrieve evidence. | 2.561161 | 1995-12-22 | 0.0 | 97.0 | Released | The Original Bad Boys. | Tom and Huck | False | 5.4 | 45.0 |
| 8 | False | 0.0 | NaN | 47686 | tt0119019 | en | Dream with the Fishes | Terry is a suicidal voyeur who treats a dying addict to a final binge, but Terry will only do this if he promises to kill him. | 0.684192 | 1997-01-01 | 0.0 | 97.0 | Released | An oddball odyssey about voyeurism, LSD and nude bowling! | Dream with the Fishes | False | 7.7 | 10.0 |
| 9 | False | 35000000.0 | NaN | 9091 | tt0114576 | en | Sudden Death | International action superstar Jean Claude Van Damme teams with Powers Boothe in a Tension-packed, suspense thriller, set against the back-drop of a Stanley Cup game.Van Damme portrays a father whose daughter is suddenly taken during a championship hockey game. With the captors demanding a billion dollars by game's end, Van Damme frantically sets a plan in motion to rescue his daughter and abort an impending explosion before the final buzzer... | 5.231580 | 1995-12-22 | 64350171.0 | 106.0 | Released | Terror goes into overtime. | Sudden Death | False | 5.5 | 174.0 |
| adult | budget | homepage | id | imdb_id | original_language | original_title | overview | popularity | release_date | revenue | runtime | status | tagline | title | video | vote_average | vote_count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 45456 | False | 0.0 | NaN | 84419 | tt0038621 | en | House of Horrors | An unsuccessful sculptor saves a madman named "The Creeper" from drowning. Seeing an opportunity for revenge, he tricks the psycho into murdering his critics. | 0.222814 | 1946-03-29 | 0.0 | 65.0 | Released | Meet...The CREEPER! | House of Horrors | False | 6.3 | 8.0 |
| 45457 | False | 0.0 | NaN | 390959 | tt0265736 | en | Shadow of the Blair Witch | In this true-crime documentary, we delve into the murder spree that was the inspiration for Joe Berlinger's "Book of Shadows: Blair Witch 2". | 0.076061 | 2000-10-22 | 0.0 | 45.0 | Released | NaN | Shadow of the Blair Witch | False | 7.0 | 2.0 |
| 45458 | False | 0.0 | NaN | 289923 | tt0252966 | en | The Burkittsville 7 | A film archivist revisits the story of Rustin Parr, a hermit thought to have murdered seven children while under the possession of the Blair Witch. | 0.386450 | 2000-10-03 | 0.0 | 30.0 | Released | Do you know what happened 50 years before "The Blair Witch Project"? | The Burkittsville 7 | False | 7.0 | 1.0 |
| 45459 | False | 0.0 | NaN | 222848 | tt0112613 | en | Caged Heat 3000 | It's the year 3000 AD. The world's most dangerous women are banished to a remote asteroid 45 million light years from earth. Kira Murphy doesn't belong; wrongfully accused of a crime she did not commit, she's thrown in this interplanetary prison and left to her own defenses. But Kira's a fighter, and soon she finds herself in the middle of a female gang war; where everyone wants a piece of the action... and a piece of her! "Caged Heat 3000" takes the Women-in-Prison genre to a whole new level... and a whole new galaxy! | 0.661558 | 1995-01-01 | 0.0 | 85.0 | Released | NaN | Caged Heat 3000 | False | 3.5 | 1.0 |
| 45460 | False | 0.0 | NaN | 30840 | tt0102797 | en | Robin Hood | Yet another version of the classic epic, with enough variation to make it interesting. The story is the same, but some of the characters are quite different from the usual, in particular Uma Thurman's very special maid Marian. The photography is also great, giving the story a somewhat darker tone. | 5.683753 | 1991-05-13 | 0.0 | 104.0 | Released | NaN | Robin Hood | False | 5.7 | 26.0 |
| 45461 | False | 0.0 | http://www.imdb.com/title/tt6209470/ | 439050 | tt6209470 | fa | رگ خواب | Rising and falling between a man and woman. | 0.072051 | NaT | 0.0 | 90.0 | Released | Rising and falling between a man and woman | Subdue | False | 4.0 | 1.0 |
| 45462 | False | 0.0 | NaN | 111109 | tt2028550 | tl | Siglo ng Pagluluwal | An artist struggles to finish his work while a storyline about a cult plays in his head. | 0.178241 | 2011-11-17 | 0.0 | 360.0 | Released | NaN | Century of Birthing | False | 9.0 | 3.0 |
| 45463 | False | 0.0 | NaN | 67758 | tt0303758 | en | Betrayal | When one of her hits goes wrong, a professional assassin ends up with a suitcase full of a million dollars belonging to a mob boss ... | 0.903007 | 2003-08-01 | 0.0 | 90.0 | Released | A deadly game of wits. | Betrayal | False | 3.8 | 6.0 |
| 45464 | False | 0.0 | NaN | 227506 | tt0008536 | en | Satana likuyushchiy | In a small town live two brothers, one a minister and the other one a hunchback painter of the chapel who lives with his wife. One dreadful and stormy night, a stranger knocks at the door asking for shelter. The stranger talks about all the good things of the earthly life the minister is missing because of his puritanical faith. The minister comes to accept the stranger's viewpoint but it is others who will pay the consequences because the minister will discover the human pleasures thanks to, ehem, his sister- in -law… The tormented minister and his cuckolded brother will die in a strange accident in the chapel and later an infant will be born from the minister's adulterous relationship. | 0.003503 | 1917-10-21 | 0.0 | 87.0 | Released | NaN | Satan Triumphant | False | 0.0 | 0.0 |
| 45465 | False | 0.0 | NaN | 461257 | tt6980792 | en | Queerama | 50 years after decriminalisation of homosexuality in the UK, director Daisy Asquith mines the jewels of the BFI archive to take us into the relationships, desires, fears and expressions of gay men and women in the 20th century. | 0.163015 | 2017-06-09 | 0.0 | 75.0 | Released | NaN | Queerama | False | 0.0 | 0.0 |
Most frequently occurring
| adult | budget | homepage | id | imdb_id | original_language | original_title | overview | popularity | release_date | revenue | runtime | status | tagline | title | video | vote_average | vote_count | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4 | False | 0.0 | NaN | 141971 | tt1180333 | fi | Blackout | Recovering from a nail gun shot to the head and 13 months of coma, doctor Pekka Valinta starts to unravel the mystery of his past, still suffering from total amnesia. | 0.411949 | 2008-12-26 | 0.0 | 108.0 | Released | Which one is the first to return - memory or the murderer? | Blackout | False | 6.7 | 3.0 | 3 |
| 0 | False | 0.0 | http://www.daysofdarknessthemovie.com/ | 18440 | tt0499456 | en | Days of Darkness | When a comet strikes Earth and kicks up a cloud of toxic dust, hundreds of humans join the ranks of the living dead. But there's bad news for the survivors: The newly minted zombies are hell-bent on eradicating every last person from the planet. For the few human beings who remain, going head to head with the flesh-eating fiends is their only chance for long-term survival. Yet their battle will be dark and cold, with overwhelming odds. | 1.436085 | 2007-01-01 | 0.0 | 89.0 | Released | NaN | Days of Darkness | False | 5.0 | 5.0 | 2 |
| 1 | False | 0.0 | http://www.dealthemovie.com/ | 11115 | tt0446676 | en | Deal | As an ex-gambler teaches a hot-shot college kid some things about playing cards, he finds himself pulled into the world series of poker, where his protégé is his toughest competition. | 6.880365 | 2008-01-29 | 0.0 | 85.0 | Released | NaN | Deal | False | 5.2 | 22.0 | 2 |
| 2 | False | 0.0 | NaN | 105045 | tt0111613 | de | Das Versprechen | East-Berlin, 1961, shortly after the erection of the Wall. Konrad, Sophie and three of their friends plan a daring escape to Western Germany. The attempt is successful, except for Konrad, who remains behind. From then on, and for the next 28 years, Konrad and Sophie will attempt to meet again, in spite of the Iron Curtain. Konrad, who has become a reputed Astrophysicist, tries to take advantage of scientific congresses outside Eastern Germany to arrange encounters with Sophie. But in a country where the political police, the Stasi, monitors the moves of all suspicious people (such as Konrad's sister Barbara and her husband Harald), preserving one's privacy, ideals and self-respect becomes an exhausting fight, even as the Eastern block begins its long process of disintegration. | 0.122178 | 1995-02-16 | 0.0 | 115.0 | Released | A love, a hope, a wall. | The Promise | False | 5.0 | 1.0 | 2 |
| 3 | False | 0.0 | NaN | 119916 | tt0080000 | en | The Tempest | Prospero, the true Duke of Milan is now living on an enchanted island with his daughter Miranda, the savage Caliban and Ariel, a spirit of the air. Raising a sorm to bring his brother - the usurper of his dukedom - along with his royal entourage. to the island. Prospero contrives his revenge. | 0.000018 | 1980-02-27 | 0.0 | 123.0 | Released | NaN | The Tempest | False | 0.0 | 0.0 | 2 |
| 5 | False | 0.0 | NaN | 152795 | tt1821641 | en | The Congress | More than two decades after catapulting to stardom with The Princess Bride, an aging actress (Robin Wright, playing a version of herself) decides to take her final job: preserving her digital likeness for a future Hollywood. Through a deal brokered by her loyal, longtime agent and the head of Miramount Studios, her alias will be controlled by the studio, and will star in any film they want with no restrictions. In return, she receives healthy compensation so she can care for her ailing son and her digitized character will stay forever young. Twenty years later, under the creative vision of the studio’s head animator, Wright’s digital double rises to immortal stardom. With her contract expiring, she is invited to take part in “The Congress” convention as she makes her comeback straight into the world of future fantasy cinema. | 8.534039 | 2013-05-16 | 455815.0 | 122.0 | Released | NaN | The Congress | False | 6.4 | 165.0 | 2 |
| 6 | False | 0.0 | NaN | 159849 | tt0173769 | en | Why We Fight: Divide and Conquer | The third film of Frank Capra's 'Why We Fight" propaganda film series, dealing with the Nazi conquest of Western Europe in 1940. | 0.473322 | 1943-01-01 | 0.0 | 57.0 | Released | NaN | Why We Fight: Divide and Conquer | False | 5.0 | 1.0 | 2 |
| 7 | False | 0.0 | NaN | 168538 | tt0084387 | en | Nana | In Zola's Paris, an ingenue arrives at a tony bordello: she's Nana, guileless, but quickly learning to use her erotic innocence to get what she wants. She's an actress for a soft-core filmmaker and soon is the most popular courtesan in Paris, parlaying this into a house, bought for her by a wealthy banker. She tosses him and takes up with her neighbor, a count of impeccable rectitude, and with the count's impressionable son. The count is soon fetching sticks like a dog and mortgaging his lands to satisfy her whims. | 1.276602 | 1983-06-13 | 0.0 | 92.0 | Released | NaN | Nana, the True Key of Pleasure | False | 4.7 | 3.0 | 2 |
| 8 | False | 0.0 | NaN | 23305 | tt0295682 | en | The Warrior | In feudal India, a warrior (Khan) who renounces his role as the longtime enforcer to a local lord becomes the prey in a murderous hunt through the Himalayan mountains. | 1.967992 | 2001-09-23 | 0.0 | 86.0 | Released | NaN | The Warrior | False | 6.3 | 15.0 | 2 |
| 9 | False | 0.0 | NaN | 25541 | tt1327820 | da | Broderskab | Former Danish servicemen Lars and Jimmy are thrown together while training in a neo-Nazi group. Moving from hostility through grudging admiration to friendship and finally passion, events take a darker turn when their illicit relationship is uncovered. | 2.587911 | 2009-10-21 | 0.0 | 90.0 | Released | NaN | Brotherhood | False | 7.1 | 21.0 | 2 |